Biochemical Knowledge Discovery Using Inductive Logic Programming

نویسندگان

  • Stephen Muggleton
  • Ashwin Srinivasan
  • Ross D. King
  • Michael J. E. Sternberg
چکیده

Machine Learning algorithms are being increasingly used for knowledge discovery tasks. Approaches can be broadly divided by distinguishing discovery of procedural from that of declarative knowledge. Client requirements determine which of these is appropriate. This paper discusses an experimental application of machine learning in an area related to drug design. The bottleneck here is in finding appropriate constraints to reduce the large number of candidate molecules to be synthesisedand tested. Such constraints can be viewed as declarative specifications of the structural elements necessary for high medicinal activity and low toxicity. The first-order representation used within Inductive Logic Programming (ILP) provides an appropriate description language for such constraints. Within this application area knowledge accreditation requires not only a demonstration of predictive accuracy but also, and crucially, a certification of novel insight into the structural chemistry. This paper describes an experiment in which the ILP system Progol was used to obtain structural constraints associated with mutagenicity of molecules. In doing so Progol found a new indicator of mutagenicity within a subset of previously published data. This subset was already known not to be amenable to statistical regression, though its complement was adequately explained by a linear model. According to the combined accuracy/explanation criterion provided in this paper, on both subsets comparative trials show that Progol’s structurally-oriented hypotheses are preferable to those of other machine learning algorithms. 1 The results in this paper are published separately in [KMSS96, SMKS96] Book Title, Edited by Editors’ Names c 1996 John Wiley & Sons Ltd

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The Parallelization of a Knowledge Discovery System with Hypergraph Representation

Knowledge discovery is a time-consuming and space intensive endeavor. By distributing such an endeavor, we can diminish both time and space. System INDED(pronounced \indeed") is an inductive implementation that performs rule discovery using the techniques of inductive logic programming and accumulates and handles knowledge using a deductive nonmonotonic reasoning engine. We present four schemes...

متن کامل

Prediction of Domain-Domain Interactions Using Inductive Logic Programming from Multiple Genome Databases

Protein domains are the building blocks of proteins, and their interactions are crucial in forming stable protein-protein interactions (PPI) and take part in many cellular processes and biochemical events. Prediction of protein domain-domain interactions (DDI) is an emerging problem in computational biology. Different from early works on DDI prediction, which exploit only a single protein datab...

متن کامل

Knowledge Discovery from Structured Mammography Reports Using Inductive Logic Programming

The development of large mammography databases provides an opportunity for knowledge discovery and data mining techniques to recognize patterns not previously appreciated. Using a database from a breast imaging practice containing patient risk factors, imaging findings, and biopsy results, we tested whether inductive logic programming (ILP) could discover interesting hypotheses that could subse...

متن کامل

Knowledge Discovery in databases - An Inductive Logic Programming Approach

The need for learning from databases has increased along with their number and size. The new eld of Knowledge Discovery in Databases (KDD) develops methods that discover relevant knowledge in very large databases. Machine learning, statistics, and database methodology contribute to this exciting eld. In this paper, the discovery of knowledge in the form of Horn clauses is described. A case stud...

متن کامل

Towards Inductive Support Logic Programming

Support logic programming and its practical implementation (Fril) integrates probabilistic and fuzzy uncertainty into logic programming using mass assignments. This paper presents a snapshot of current research, aimed at combining the best aspects of inductive logic programming with the uncertainty representation of Fril to create a sophisticated and novel approach to knowledge discovery. An ex...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1998